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MODIFIED BACILLUS THURINGIENS IS 
INSECTICIDAL-CRYSTAL PROTEIN GENES AND THEIR 
EXPRESSION IN PLANT CELLS 

This invention provides a modified Bacillus 
thurinaiensis ("Bt") gene (the "modified BtlCP gene") 
encoding all or an insect icidally-ef fective portion of 
a Bt insecticidal crystal protein ("ICP"). a plant, 
transformed with the modified Bt ICP gene can show 
higher expression levels of the encoded ICP and 
improved insect-resistance. 

Background of the Invention 

Plant genetic engineering technology has made 
significant progress during the last 10 years. It has 
become possible to introduce stably foreign genes into 
plants. This has provided exciting opportunities for 
modern agriculture. Derivatives of the Ti-plasmid of 
the plant pathogen, Agrobacterium tumefaciens, have 
proven to be efficient and highly versatile vehicles 
for the introduction of foreign genes into plants and 
plant cells. In addition, a variety of free DNA 
delivery methods, such as electroporation, 
microinjection, pollen-mediated gene transfer and 
particle gun technology, have been developed for the 
same purpose. 

The major aim of plant transformations by genetic 
engineering has been crop improvement. In an initial 
phase, research has been focused on the engineering 
into plants of useful traits such as insect-resistance. 
In this respect, progress in engineering insect 
resistance in transgenic plants has been obtained 
through the use of genes, encoding ICPs, from Bt 
strains (Vaeck et al. r 1987). A Bt strain is a spore 
forming gram-positive bacterium that produces a 
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parasporal crystal which is composed of crystal 
proteins which are specifically toxic against insect 
larvae. Bt iCPs possess a specific insecticidal 
spectrum and display no toxicity towards other animals 
and humans (Gasser and Fraley, 1989) . Therefore, the Bt 
ICP genes are highly suited for plant engineering 
purposes. 

For more than 20 years, Bt crystal spore 
preparations have been used as biological insecticides. 
The commercial use of Bt sprays has however been 
limited by high production costs and the instability of 
crystal proteins when exposed in the field (Vaeck et 
al., 1987). The heterogeneity of Bt strains has been 
well documented. Strains active against Lepidoptera 
(Dulmage et al., 1981), Diptera (Goldberg and Margalit, 
1977) and Coleoptera (Krieg et al., 1983) have been 
described. 

Bt strains produce endogenous crystals upon 
sporulation. Upon ingestion by insect larvae, the 
crystals are solubilized in the alkaline environment of 
the insect midgut giving rise to a protoxin which is 
subsequently proteolytically converted into a toxic 
core fragment or toxin of 60-70 kDa. The toxin causes 
cytolysis of the epithelial midgut cells. The 
specificity of Bt ICPs can be determined by their 
interaction with high-affinity binding sites present on 
insects 1 midgut epithelia. 

The identification of Bt ICPs and the cloning and 
sequencing of Bt ICP genes has been reviewed by Hofte 
and Whiteley (1989). The Bt ICP genes share a number of 
common properties. They generally encode insecticidal 
proteins of 130 kDa to 140 kDa or of about 70 kDa, 
which contain toxic fragments of 60 ± 10 kDa (Hofte and 
Whiteley, 1989) . The Bt ICP genes have been classified 
into four major groups according to both their 



structural similarities and insecticidal spectra (Hofte 
and White ley, 1989) : Lepidoptera-specif ic (Cryl) , 
Lepidoptera- and Diptera-specific (Cryll), Coleoptera- 
specific (Crylll) and Diptera-specific (Cry IV) genes. 
The Lepidoptera-specif ic genes (Cryl) all encode 
130-140 kDa proteins. These proteins are generally 
synthesized as protoxins. The toxic domain is localized 
in the N-terminal half of the pro toxin. Deletion 
analysis of several Cryl genes confirm that 3 1 portions 
of the protoxins are not absolutely required for toxic 
activity (Schnepf et al., 1989). Cry II genes encode 65 
kDa proteins (Widner and Whiteley, 1985) . The Cry II A 
proteins are toxic against both Lepidoptera and Diptera 
while the Cry II B proteins are toxic only to 
Lepidopteran insects. The Coleoptera-specif ic genes 
(Cry III) generally encode proteins with a molecular 
weight of about 70 kDa. (Whiteley and Hofte, 1989). The 
corresponding gene (cry III A) expressed in coli 
directs the synthesis of a 72 kDa protein which is 
toxic for the Colorado potato beetle. This 72 kDa 
protein is processed to a 66 kDa protein by spore- 
associated bacterial proteases which remove the first 
57 N-terminal amino acids (Mc Pherson et al., 1988). 
Deletion analysis demonstrated that this type of gene 
cannot be truncated at its 3' -end without the loss of 
toxic activity (Hofte and Whiteley, 1989). Recently, an 
anti-coleopteran strain, which produces a 130 kDa, 
protein has also been described (European patent 
application ("EPA") 89400428.2). The cry IV class of 
crystal protein genes is composed of a heterogenous 
group of Diptera-specific crystal protein genes (Hofte 
and Whiteley / 1989) . 

The feasibility of generating insect-resistant 
transgenic crops by using Bt ICPs has been 
demonstrated. (Vaeck et al. , 1987 ; Fischoff et al., 
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1987 and Barton et al., 1987). Transgenic plants offer 
an attractive alternative and provide an entirely new 
approach to insect control in agriculture which is at 
the sane tine safe, environmentally attractive and 
cost-effective. (Heeusen and Warren, 1989). Successful 
insect control has been observed under field conditions 
(Delannay et al., 1989 ; Heeusen and Warren, 1989). 

In all cases, Agxobacterium-nediated gene transfer 
has been used to express chimaeric Bt ICP genes in 
plants (Vaeck et al., 1987; Barton et al., 1987; 
Pischoff et al., 1987). Bt ICP genes were placed under 
the control of a strong promoter capable of directing 
gene expression in plant cells, it is however 
remarkable that expression levels in plant cells were 
high enough only to obtain insect-killing levels of Bt 
ICP genes when truncated genes were used (Vaeck et al., 
1987.- Barton et al., 1987). None of the transgenic 
plants containing a full-length Bt ICP gene produced 
insect-killing activity. Moreover, Barton et al. (1987) 
showed that tobacco call! transformed with the entire 
Bt ICP coding sequence became necrotic and died. These 
results indicate that the Bt ICP gene presents unusual 
problems that must be overcome to obtain significant 
levels of expression in plants. Even, when using a 
truncated Bt ICP gene for plant transformation, the 
steady state levels of Bt ICP mRNA obtained in 
transgenic plants are very low relative to levels 
produced by both an adjacent NPT II-gene, used as a 
marker, and by other chimeric genes (Barton et al., 
1987; Vaeck et al., 1987). Moreover, the Bt ICP mRNA 
cannot be detected by northern blot analysis, similar 
observations were made by Fischoff et al. (1987); they 
reported that the level of Bt ICP mRNA was much lower 
than expected for a chimeric gene expressed from the 
CaMV35S promoter. In other words, the cytoplasmic 



accumulation of the bt mRNA, and consequently the 
synthesis, the accumulation and thereby the expression 
of the Bt ICP protein in plant cells, are extremely 
inefficient. By contrast, in microorganisms, it has 
been shown that truncated Bt ICP genes are less 
favorable than full-length genes (Adang et al., 1985), 
indicating that the inefficient expression is solely 
related to the heterologous expression of Bt ICP genes 
in plants. 

The problem of obtaining significant Bt ICP 
expression levels in plant cells seems to be inherent 
and intrinsic to the Bt ICP genes. Furthermore, the 
relatively low and poor expression levels obtained in 
plants appears to be a common phenomenon for all Bt ICP 
genes • 

It is known that there are six steps at which gene 
expression can be controlled in eucaryotes (Darnell, 
1982): 

1) Transcriptional control 

2) RNA processing control 

3) RNA transport control 

4) mRNA degradation control 

5) translational control 

6) protein activity control 

For all genes, transcriptional control is 
considered to be of paramount importance (The Molecular 
Biology of the Cell, 1989), 

In European patent publications ("EP") 385,962 and 
359,472, efforts to modify the codon usage of Bt ICP 
genes to improve their expression in plant cells have 
been reported. However, wholesale (i.e., non-selective) 
changes in codon usage can introduce cryptic regulatory 
signals in a gene, thereby causing problems in one or 
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more of the six steps mentioned above for gene 
expression, and thus inhibiting or interfering with 
transcription and/or translation of the modified 
^ foreign gene in plant cells. For example, changes in 
codon usage can cause differential rates of inRNA 
production, producing instability in the mRNA, so 
produced (e.g., by exposure of regions of the mRNA, 
unprotected by ribosomes, to attack and degradation by 
cytoplasmic enzymes), changes in codon usage also can 
inadvertantly cause inhibition or termination of RNA 
polymerase II elongation on the so-modified gene. 

Summary of tha invention 

In accordance with this invention is provided a 

15 process for modifying a foreign gene, particularly a Bt 
ICP gene, whose level and/or rate of expression in 
plant cells, transformed with the gene, is limited by 
the rate and/or level of nuclear production of an mRNA 
encoded by the gene; the process comprises the step of 

20 changing adenine and thymine sequences to corresponding 
guanine and cytosine sequences encoding the same amino 
acids in a plurality of translational codons of the 
gene that would otherwise directly or indirectly cause 
a nuclear event which would negatively control (i e 

25 inhibit or interfere with) transcription, nuciear 
accumulation and/or nuclear export of the mRNA 
particularly transcription, quite particularly 
elongation of transcription by RNA polymerase II of the 
plant cells. Preferably, the adenine and thymine 

30 sequences are changed to cytosine and guanine sequences 
in translational codons of at least one region of the 
gene which, during transcription, would otherwise have 
thereon a relatively low percentage of rna polymerase 
II as compared to another adjacent upstream (i.e., 5') 
region of the gene. 



Also in accordance with this invention is provided 
the modified Bt ICP gene resulting from the process. 

Further in accordance with this invention, a 
process is provided for improving the resistance of a 
plant against insect pests by transforming the plant 
cell genome with at least one modified Bt ICP gene. 

This invention also relates to a chimaeric gene 
that can be used to transform plant cells and that 
contains the following operably-1 inked DNA fragments in 
the same transcriptional unit: 

1) the modified Bt ICP gene; 

2) a promoter suitable for directing transcription 
of the modified Bt ICP gene in the plant cells; 
and 

3) suitable transcript 3' end formation and 
polyadenylation signals for expressing the 
modified Bt ICP gene in the plant cells* 

This invention further relates to: 

- a cell of a plant, the nuclear genome of which 
has been transformed to contain, preferably stably 
integrated therein, the modified Bt ICP gene, 
particularly the chimaeric gene; 

- cell cultures consisting of the plant cell; 

- a plant which is regenerated from the 
transformed plant cell or is produced from the 
so-regenerated plant, the genome of which contains 
the modified Bt ICP gene, particularly the 
chimaeric gene, and which shows improved 
resistance to insect pests; 

- seeds of the plant; and 

- a vector for stably transforming the nuclear 
genome of plant cells with the modified Bt ICP 
gene, particularly the chimaeric gene. 
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Detailed Descript ion of the Invention 

As used herein, "Bt ICP" should be understood as 
an intact protein or a part thereof which has 
insecticidal activity and which can be produced in 
nature by B. thurlnaienBis. a Bt ICP can be a protoxin, 
as well as an active toxin or other insecticidal 
truncated part of a protoxin which need not be 
crystalline and which need not be a naturally occurring 
protein. An example of a Bt ICP is a Bt2 insecticidal 
crystal protein (Hofte et al., 1986), as well as its 
insecticidally effective parts which are truncated at 
its C- and/or N-terminal ends towards its tryspsin 
cleavage site(s) and preferably having a molecular 
weight of 60-80 kDa. Other examples of Bt icps are: 
Bt2, Bt3, Bt4, Btl3, Btl4, Btl5, Btl8 f Bt21, Bt22,' 
Bt73, Bt208, Bt245, BtI260 and BtI109P as disclosed in 
PCT publications WO90/15139 and W0 90 /09445, in Hofte 
and Whiteley (1989) and in EPA 90403724.9. 

As used herein, "protoxin" should be understood as 
the primary translation product of a full-length gene 
encoding a Bt ICP. 

As used herein, "toxin" or "active toxin" or 
"toxic core" should all be understood as a part of a 
protoxin which can be obtained by protease (e.g., by 
trypsin) cleavage and has insecticidal activity. 

As used herein, "truncated Bt gene" should be 
understood as a fragment of a full-length Bt gene which 
still encodes at least the toxic part of the Bt ICP, 
preferentially the toxin. 

As used herein, "modified Bt ICP gene" should be 
understood as a DNA sequence which encodes a Bt ICP, 
and in which the content of adenine ("A") and thymine 
<"T") has been changed to guanine ("G") and cytosine 
("C") in codons, preferably at least 3, in at least one 
region of the DNA sequence without affecting the 



original amino acid sequence of the Bt ICP. Preferably 
in at least two regions, especially in at least three 
regions, of the DNA sequence, the A and T content is 
changed to 6 and C in at least 3 codons. For regions 
downstream of the translation initiation site of the 
DNA sequence, it is preferred that the A-T content of 
at least about 10 codons, particularly at least about 
33 codons, be changed to G-C. 

By "region" of a modified Bt ICP gene is meant any 
sequence encoding at least three translational codons 
which affect expression of the gene in plants* 

In accordance with this invention, it has been 
shown by means of mRNA turn-over studies that the 
expression pathway of a Bt ICP gene, such as bt2 , btl4, 
bt!5 and btl8, is specifically inhibited at the nuclear 
level in plant cells. In a further analysis, nuclei of 
transgenic tobacco plants, i.e., N28 - 220 (Vaeck et 
al., 1987), were used in a nuclear run-on assay to 
determine the distribution and the relative efficiency 
of RNA polymerase II complexes to initiate 
transcription of chimaeric Bt ICP plant genes. In this 
regard, the run-on assay has been used to determine 
initially the relative efficiency of RNA polymerase II 
complexes to initiate transcription of Bt ICP genes and 
thereafter to determine the relative distribution and 
migration efficiency of the RNA polymerase II complexes 
on the Bt ICP genes, 

N2 8 - 220 contains the bt884 fragment under 
control of the TR 2' promoter as a chimaeric gene. 
Bt884 is a 5" fragment of the bt2 gene (Hofte et al., 
1986) up to codon 610 (Vaeck et al., 1987). Using 
nuclear run-on analysis, isolated nuclei of N28 - 220 
were incubated with highly labeled radioactive RNA 
precursors, so that the RNA transcripts being 
synthesized at the time became radioactively labeled. 
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The RNA polymerase n molecules caught in the act of 
transcription in the cell continue elongating the same 
RNA molecules in vitro . 

The nuclear run-on assays of nuclei of N28 - 220 
culture (non-induced cells and induced cells, TRl'-neo. 
TR2»-bt884) revealed that transcription from the TRl' 
and TR2» promoters is about equally efficient. This 
implies that the low Bt 1CP (i.e., B t884) expression 
levels are not due to a specifically reduced 
transcriptional activity of the TR2 • promoter. However, 
nuclear run-on analysis with N28 - 220 nuclei indicated 
that transcription elongation of the nascent Bt ICP 
mRMA is impaired somewhere between 700 to 1000 
nucleotides downstream of the start of transcription. 
This means that RNA polymerase II is not able to 
transcribe the Bt icp coding sequence with 100 % 
efficiency. Filter binding assays using labeled Bt DMA 
fragments spanning this region and protein extract 
prepared from tobacco nuclei reveal that this DMA 
region undergoes specific interactions with proteins 
present in nuclei. These interactions are the prime 
candidates that cause or affect the impaired elongation 
of transcription by RNA polymerase II through this 
region. By modification of this region to abolish 
specific protein binding, Bt ICP expression levels will 
increase. However, other mechanisms responsible for 
impaired elongation in this region cannot be excluded. 

Further in accordance with this invention, 
sequences within the coding region involved in negative 
control of cytoplasmic Bt ICP mRMA levels have been 
identified by deletion analysis. To this end, 24 
deletion derivatives of pVE36 have been constructed. 
Three main types of deletion mutants have been 
constructed (see fig. 3): 
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- 5* end deletions 

- 3' end deletions 

- internal deletions. 

5 The expression of a mutant hybrid bt2-neo gene 

(encoding a fusion protein of Bt2 (Hdfte et al., 1986) 
and NPTII) has been studied by means of transient 
expression experiments using the cat gene as a 
reference. To this end, the neo mRNA levels were 

10 measured in relation to cat mRNA levels in RNA extracts 
of SRI protoplasts. The ratio between the neo and cat 
mRNA level was used to quantify on a relative basis the 
nptll transcript (i.e., mRNA) levels produced by the 
different constructions. These experiments show that 

15 progressive deletions of the carboxy-terminal (i. e., 
3*) part or the amino-terminal (i. e., 5 1 ) part of the 
Bt ICP coding sequence result in a gradual increase of 
the nptll transcript level. Furthermore, since the 
changes in transcript levels are not very abrupt, these 

20 results suggest that the low transcript levels produced 
by Bt ICP genes are not controlled by a single factor. 
Nevertheless, individual modifications of bt2 coding 
sequence can significantly reduce the interference 
and/or inhibition of the expression of the mRNA encoded 

25 by Bt ICP genes in plant cells at the level of 
transcript elongation, nuclear accumulation and nuclear 
export. The modification (s) may also affect cytoplasmic 
regulation and metabolism of such mRNAs and their 
translation. 

30 Deletion analysis clearly indicates that several 

internal sequences, located within the Bt ICP coding 
region, might be involved in the negative regulation of 
the Bt ICP expression. By way of example, a 326 bp 
region (fig. 6b) was identified in the bt2 gene that is 
involved in the negative control of BT ICP expression 
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and that is located between nucleotide position 674 and 
nucleotide position 1000, particularly a 268 bp region 
between nucleotide positions 733 and 1000, quite 
particularly a 29 bp region between nucleotide 
positions 765 and 794 which carries two perfect CCAAT 
boxes which are known to be able to cause a reduction 
in elongation efficiency and termination of 
transcription by rna polymerase II in animal systems 
iq (Connelly and fenley, i 989 ) . This internal gene 
fragment or inhibitory zone may itself comprise a 
plurality of inhibitory zones which reduce Bt ICP 
expression levels or which interact directly or 
indirectly with other zones to inhibit or interfere 
i5 with expression. Codon usage of this inhibitory zone 
has been modified in a second step by substituting A - 
T with 6 - c without affecting the amino acid sequence, 
in this regard, this internal 326 bp fragment (fig. 6b) 
has been replaced with a modified Bt ICP fragment of 
2q this invention containing 59 modified codons. The 
effect of such modification of this inhibitory zone on 
Bt ICP expression has been analyzed both in transient 
and stable plant transformants. The results show that 
such modification of codon usage causes a significant 
^ increase of Bt ICP expression levels and hence improved 
insect-resistance . 

In addition, H-terminal deletion mutants of the 
bt2 gene have been made by deleting the first N- 
terminal 28 amino acids (Hofte et al., 1986). it is 
3Q known for the bt2 gene that the first 28 codons can be 
deleted without loss of toxicity (Hofte et al., i 986 - 
Vaeck et al., 1987). Also, codon usage for three 
codons, 29 to 31, has been changed in accordance with 
this invention by replacing A - t with 6 - c without 
affecting the amino acid sequence. Furthermore, an 
optimal translation initiation (ATG) site was created 
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based on the consensus sequence of Joshi (1987) as 
shown in fig, 6a. Plants transformed with this modified 
Bt ICP gene show significantly higher Bt ICP expression 
levels. 

In accordance with this invention, all or part of 
a modified Bt ICP gene of the invention can be stably 
inserted in a conventional manner into the nuclear 
genome of a plant cell, and the so-transformed plant 
cell can be used to produce a transgenic plant showing 
improved expression of the Bt ICP gene. In this regard, 
a disarmed Ti-plasmid, containing the modified Bt ICP 
gene, in Agrobacterium (e.g., tumefaciens) can be 
used to transform a plant cell using the procedures 
described, for example, in EP 116,718 and EP 270,822, 
PCT publication 84/02913, EPA 87400544.0 and Gould et 
al. (1991) (which are incorporated herein by 
reference). Preferred Ti-plasmid vectors contain the 
foreign DNA sequence between the border sequence, or at 
least located to the left of the right border sequence, 
of the T-DNA of the Ti-plasmid. Of course, other types 
of vectors can be used to transform the plant cell, 
using procedures such as direct gene transfer (as 
described, for example, in EP 233,247), pollen mediated 
transformation (as described, for example, in EP 
270,356, PCT publication WO 85/01856, and US patent 
4,684,611), plant RNA virus-mediated transformation (as 
described, for example, in EP 67,553 and US patent 
4,407,956), liposome-mediated transformation (as 
described, for example, in US patent 4,536,475) and 
other methods such as the recently described methods 
for transforming certain lines of corn (Fromm et al., 
1990; Gordon-Kamm et al., 1990). 

Preferably, the modified Bt ICP gene is inserted 
in a plant genome downstream of, and under the control 
of, a promoter which can direct the expression of the 
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gene in the plant cells. Preferred promoters Include, 
but are not Halted to, the strong constitutive 35S 
promoter (Odell et al. r 1985) of cauliflower mosaic 
virusj 35S promoters have been obtained from different 
isolates (Hull and Howell, Virology 86, 482-493 
(1987)). other preferred promoters include the TR1' 
promoter and the TR2« promoter (Velten et al., 1984). 
Alternatively, a promoter can be utilized which is not 
constitutive but rather is specific for one or more 
tissues or organs. For example, the modified Bt 1CP 
gene can be selectively expressed in the green tissues 
of a plant by placing the gene under the control of a 
light-inducible promoter such as the promoter of the 
ribulose - 1,5 - phosphate - carboxylase small subunit 
gene as described in EPA 86300291.1. Another 
alternative is to use a promoter whose expression is 
inducible by temperature or chemical factors. 

It is also preferred that the modified Bt 1CP gene 
be inserted upstream of suitable 3- transcription 
regulation signals (i.e., transcript 3' end formation 
and polyadenylation signals) such as the 3« 
untranslated end of the octopine synthase gene (Gielen 
et al.. 1984) or T-DNA gene .7 (Velten and Schell, 
1985) . 

The resulting transformed plant of this invention 
shows improved expression of the modified Bt icp gene 
and hence is characterized by the production of high 
levels of Bt ICP. such a plant can be used in a 
^ conventional breeding scheme to produce more 
transformed plants with the same improved insect- 
resistance characteristics or to introduce the modified 
Bt ICP gene into other varieties of the same or related 
plant species. Seeds, which are obtained from the 
transformed plants, contain the modified Bticp gene as 
a stable genomic insert. 
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Furthermore, at least two modified BtlCP genes, 
coding for two non-competitively binding anti- 
Lepidopteran or anti-Coleopteran Bt ICPs, can be cloned 
into a plant expression vector (EPA 89401499 ♦ 2) . Plants 
transformed with such a vector are characterized by the 
simultaneous expression of at least two modified BtlCP 
genes. The resulting transgenic plant is particularly 
useful to prevent or delay development of resistance to 
Bt ICP of insects feeding on the plant. 

The following Examples illustrate the invention 
and are not intended to limit its scope. The Figures, 
referred to in the Examples, are as follows: 

Fig.l — Comparison of the transcription initiation 
15 frequency of RNA polymerase II complexes in nuclei of 
N28-220. Hybridisation efficiencies of labeled nptll 
mRNA and Bt ICP mRNA with their complementary DNA 
counterparts present on a Southern blot were compared. 
DNA fragments were obtained from a digest of plasmid 
20 pGSH163. A schematic view of the region is given. The 
lengths of the fragments blotted on Hybond-N filter 
(1), the homologous genes on plasmid pGSH163 (2), and 
the densitometric values (3) are as follows: 
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Digest: 12 3 

BamHI/Hindlll 2358 neo 12386 

1695 bt2 6565 

154 bt2 

6250 vector 

Fig. 2a — Determination of the distribution of the RNA 
polymerase II complexes on the Bt ICP coding sequence 
in nuclei of N28-220. The hybridisation of labeled RNA 
prepared by nuclear run on with DNA fragments of the Bt 
ICP coding sequence was quant ita ted. The restriction 
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fragments and scanning values are given in the table 
and figure. The scanning value is proportional to "X", 
the size of the DNA fragment and the #' OTP per RNA 
fragment hybridising. "X" is directly proportional to 
the number of RNA polymerases passing through the DNA 
fragment, »x" is proportional to the scanning value 
divided by the number of OTPs. The X values of the 
different restriction fragments are shown in the 
figure. In this regard, conversion of the different 
densitometric values into relative hybridisation 
efficiencies by normalising the values of the number of 
dATPs present in the DNA fragment, complementary to the 
hybridising RNA, generates the value "X M . "X" is a 
relative measure of the number and the length of the 
extension of the transcripts. "X" thus reflects the 
number of RNA polymerases transcribing a specific DNA 
sequence and their elongation rate. DNA fragments 
present on the Southern digests of plasmid DNA of plant 
vector pGSH163 each have the following lengths of 
fragments blotted on Hybond-N filter (l) , homologous 
genes on plasmid pGSH163 (2) and densitometric values 
(3): 

Digest: 

BamHI/EcoRI 



BamHI/EcoRV 



BamHI/Hindlll 



1 


2 


3 


8877 


neo 


15333 


726 


bt2(2) 


2926 


583 


bt2(3) 


635 


271 


bt2(l) 




8887 


neo 


15182 


84 


bt2 


2466 


729 


bt2 


1102 


6250 






2358 


neo 


12386 
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BamHI/SacI 



Xmnl 



1695 
154 

8053 
1353 
1051 

4973 

2107 

1401 

729 

628 

305 

188 

120 



bt2 



neo 

bt2(l) 

bt2(2) 

neo 



bt2(3) 
bt2(2) 
bt2(4) 
bt2(5) 
bt2(l) 



6565 

14194 

4572 

615 

13219 



736 
1817 



Fig, 2b — Schematic view of nine bt884 DKA fragments 
that were inserted into the poly linker of M13 vectors, 
MP18 and MP19 (Yanisch-Perron et al., 1985), The Bt ICP 
coding sequence is shown from AUG to 1600 nucleotides 
downstream. The relevant restriction sites and sizes of 
the DNA fragments are indicated. The nucleotide 
numbering is relative to the AUG. The subclones were 
named pJD71, pJD72 f pJD73, etc. (to pJD79), as 
indicated. The inserts were oriented into the M13 
vector such that single standed H13 carried the 
fragments of the Bt ICP coding sequence in an anti- 
sense orientation* 

Fig. 2c — Schematic representation of three nuclear 
run-on analyses with N28-220 nuclei as described by Cox 
and Goldberg (1988) • Assays were performed for periods 
of 5, 10 and 30 minutes. The labeled nuclear RNA was 
allowed to hybridize with 5 fig of single stranded 
pJD71-pJD79 and MP18 DNA, which were immobilised on 
nylon membranes. The membranes were autoradiographed, 
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and densitometry values were obtained by scanning the 
autoradiographs. The abscissa shows the nucleotide 
position relative to the AUG of the Bt (i.e., bt2) 
coding sequence. The center of each of the single 
stranded Bt DNA fragments is indicated in the graph. 
The ordinate gives the relative hybridisation signal 
for each fragment corrected for the number of dATPs in 
the fragment and adjusted to 100% for the value of 
PJD71 for each of the three incubation periods. All 
values are corrected for non-specific hybridisation to 
single stranded MP18 DMA. The relative values are a 
measure for the reactivation of bt mRNA synthesis by 
RNA polymerase II. The assay does not distinguish 
between the number of mRNA extensions and the length of 
mRNA extensions. 

Fig. 3 ~ Construction of deletion mutants of the 
bt860-neo gene to measure the effect on cytoplasmic Bt 
ICP mRNA levels. The parental vector pVE36 is shown. 
The following deletion mutants were generated: 

1. PJD50: pJD50 was derived from pVE36 by digesting 

with BamHI and Sphl. The 5 'and 3' 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp*" phenotype. 

2. PJD51: pJD51 was derived from pVE36 by digesting 

with spel and Sphl. The 5 "and 3t 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 
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3. PJD52: pJD52 was derived from pVE36 by digesting 

with EcoRV and Sphl. The 5'and 3 1 
protruding ends were filled in with Klenov 
DNA polymerase I enzyme. The treated DNA 
was 1 igated and then used to trans form 
NC1061 cells. Transformants were selected 
for amp r phenotype. 

4. PJD53: pJD53 was derived from pVE36 by digesting 

with Xcal and Sphl. The 3 1 protruding ends 
were filled in with Klenow DNA polymerase 
I enzyme. The treated DNA was ligated and 
then used to transform HC1061 cells. 
Transformants were selected for amp r 
phenotype. 

pJD54 was derived from pVE36 by digesting 
with Af III and Sphl . The 5 1 and 3 • 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was 1 igated and then used to trans form 
MC1061 cells. Transformants were selected 
for amp r phenotype. 

6. PJD55: pJD55 was derived from pV£36 by digesting 

with Clal and Sphl. The 5» and 3 1 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MCI 061 cells. Transformants were selected 
for amp r phenotype. 

7. PJD56: pJD56 was derived from pVE36 by digesting 

with Xhol and Sphl. The 5' and 3» 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 
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8. PJD57: 



9. PJD58: 



10 



15 



10.FJD59: 



20 



11* PJD60: 



25 



12. PJD61: 
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PJD57 was derived from pVE36 by digesting 
with Aflll and BamHI. The 5 V and 3* 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
HC1061 cells. Transformants were selected 
for amp r phenotype. 

pJD58 was derived from pVE36 by digesting 
with Xcal and BamHI. The 5» protruding 
ends were filled in with Klenow DNA 
polymerase 1 enzyme. The treated DNA was 
ligated and then used to transform MC1061 
cells. Transformants were selected for 
amp r phenotype. 

PJD59 was derived from pVE36 by digesting 
with EcoRV and BamHI. The 5 ' protruding 
ends were filled in with Klenow DNA 
polymerase I enzyme. The treated DNA was 
ligated and then used to transform MC1061 
cells. Transformants were selected for 
amp r phenotype. 

PJD60 was derived from pVE36 by digesting 
with Spel and BamHI. The 5 1 protruding 
ends were filled in with Klenow DNA 
polymerase I enzyme. The treated DNA was 
ligated and then used to transform MC1061 
cells. Transformants were selected for 
amp r phenotype. 

PJD61 was derived from PJD50. PVE36 was 
digested with Xbal and filled in with 
Klenow polymerase I. PJD50 was linearized 
with BamHI and filled in with Klenow 
polymerase I. The 375bp Xbal fragment of 
PVE36 was ligated in the filled in BamHI 
of pJD50. The ligation mixture was used to 
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transform MC1061 cells. Transformants were 
selected for amp r phenotype. 

13. PJD62: PJD62 was derived from PJD50. PVE36 was 

digested with Xcal and BcoRV. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 3 67 bp Xcal-EcoRV 
fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform MC1061 
cells, Transformants were selected for 
amp r phenotype. 

14. PJD63: PJD63 was derived from PJD50. PVE36 was 

digested with Xcal and EcoRV. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 474bp Xcal-EcoRV 
fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform MC1061 
cells. Transformants were selected for 
amp r phenotype. 

15. PJD64: PJD64 was derived from PJ050. PVE36 was 

digested with EcoRI and EcoRV and filled 
in with Klenow polymerase I. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 458bp EcoRI -EcoRV 
fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform MC1061 
cells. Transformants were selected for 
amp r phenotype. 

16. PJD65: PJD65 was derived from PJD50. PVE36 was 

digested with EcoRI and Xbal and filled in 
with Klenow polymerase I. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 327bp EcoRI -Xbal 
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fragment of PVE36 was ligated In the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform HC1061 
cells. Transformants were selected for 
amp r phenotype. 

17. PJD66: PJD66 was derived from PJD50. PVE36 was 

digested with Spel and Xcal and filled in 
with Klenow polymerase I. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 1021bp Spel -Xcal 
fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform HC1061 
cells. Transformants were selected for 
amp r phenotype. 

18. PPS56D1: PPS56D1 was derived from PJD56 by 

digesting with EcoRV. The treated DNA was 
ligated and then used to transform HC1061 
cells. Transformants were selected for 
amp r phenotype. 

19. PPS56D2: PPS56D2 was derived from PJD56 by 

digesting with Xcal and Aflll. The 5» 
protruding ends were filled in with Klenow 
polymerase I. The treated DNA was ligated 
and then used to transform MC1061 cells. 
Transformants were selected for amp r 
phenotype. 

20. PPS56D3: PPS56D3 was derived from PJD56 by 

digesting with Spel and EcoRV. The 5 1 
protruding ends were filled in with Klenow 
polymerase I. The treated DNA was ligated 
and then used to transform HC1061 cells. 
Transformants were selected for amp r 
phenotype. 
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21. PPS56D4: PPS56D4 was derived from PJD56 by 

digesting with Xcal and partially with 
EcoRV. The treated DNA was ligated and 
then used to transform MC1061 cells. 
5 Transformants were selected for amp r 

phenotype. 

22. PPS56D6: PPS56D6 was derived from PJD56 by 

digesting with Spel and partially with 
EcoRV. The 5* protruding ends were filled 
10 in with Klenow polymerase I. The treated 

DNA was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 

23. PPS56D7: PPS56D7 was derived from PJD56 by 
15 digesting with Spel and Xcal. The 5' 

protruding ends were filled in with Klenow 
polymerase I. The treated DNA was ligated 
and then used to transform MC1061 cells. 
Transformants were selected for amp r 

20 phenotype. 

24* PPS56D8: PPS56D8 was derived from PPS56D2 by 
digesting with Spel and partially with 
EcoRV. The 5' protruding ends were filled 
in with Klenow polymerase I. The treated 

25 DNA was ligated and then used to transform 

HC1061 cells. Transformants were selected 
for amp r phenotype. 

Fig. 4 — Effect of deletions in the Bt ICP coding 
30 sequence on cytoplasmic Bt ICP mRNA levels. The 
cytoplasmic mRNA levels specified by the invariable cat 
reference gene and the different Bt ICP deletion 
mutants described in fig. 3 are listed in the table. 
The measurements were converted into relative Bt ICP 
mRNA abundances. Bt ICP and cat mRNA quantizations were 
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done as described by Comelissen (1989) . Total RNA was 
slot blotted and hybridised with radioactively labeled 
RHA complementary to the neo and cat coding sequences. 
Values were quantitated with the aid of calibration 
curves of cold cat and Bt ICP riboprobe transcripts. 

Fig. 5 — Relative transcript levels produced by the 
deletion derivatives of pVE36. 

10 6a « Schematic presentation of the synthetic DNA 

sequences used to introduce a N-terminal deletion and a 
change of the codons 29, 30 and 31 of the bt2 coding 
sequence. The oligo nucleotides were annealed according 
to Engler et al. (1988) and cloned into the BstXI 

15 restriction site of plasmid pVE36, yielding pPS027. The 
7360 bp fragment of pPS027 was ligated to the the 1177 
bp Clal restriction fragment of pVE36, yielding plasmid 
PPS028. pPS028 is identical to pVE36 apart for the N- 
terminal modification. 
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Pig. 6b ~ Schematic presentation of the synthetic DMA 
sequences used to introduce an internal modification 
into the bt2 coding sequence. The oligonucleotides were 
annealed and ligated as described by Engler et al. 
(1988) and the resulting concatemeric DNA fragment was 
cut with the restriction enzymes Xbal and EcoRI to 
release the modified 327 bp Xbal-EcoRl restriction 
fragment. This fragment was ligated into the 3530 bp 
EcoRI-Xbal fragment of pPS023 which is a pUC19 
derivative (Yanisch-Perron et al., 1985) that carries 
the 1533 bp Aflll (filled in) BamHI fragment of pVE36 
in the Hindlll (filled in) BamHI site of puci9, 
resulting in plasmid pPS024. Plasmid pPS024 was 
linearised by digestion with restriction enzyme Xbal 
and the 375 bp Xbal restriction fragment of pPS023 was 
introduced resulting in pPS025. The 1177 bp Clal 
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fragment of pPS025 was introduced in the 7360 bp Clal 
restriction fragment of pPS027 yielding pPS029. pPS029 
is identical to pVE36 but carries both the amino- 
terminal modification and the internal modification of 
the Bt ICP coding sequence. 

Fig. 6c — Nucleotide sequences 800 to 4000 of the 
plasmids pV£36 and pPS029. "x" refers to not known 
nucleotides. 

Fig. 7 — Schematic presentation of the effect of the 
mutations on the AT content of the Bt ICP plant gene. 
The modified regions are indicated. 

Fig. 8a ~ Schematic presentation of the plasmid 
constructions used in the transient expression assay. 
The relevant genes are indicated. 

Fig. 8b — Accumulation profiles of CAT (Neumann et 
al., 1987) and the modified BtlCP (Engvall and Pesce, 
1978) in a typical transient expression assay. 

Unless otherwise stated in the Examples, all 
procedures for making and manipulating recombinant DNA 
are carried out by the standardized procedures 
described in Sambrook et al . f Molecular cloning - A 
laboratory Manual , Cold Spring Harbor Laboratory 
(1989) • 

Example 1* Determination of the Efficiency of 
Transcription initiation 

The relative efficiency of RNA polymerase II 
complexes to initiate transcription at chimaeric BtlCP 
plant genes was studied, using transgenic plant N28-220 
which is described by Vaeck et al. (1987) and contains 
copies of the T-DNA of plasmid p6SH163 This T-DNA 
carries the chimaeric plant genes P IR2 bt8843 *g7 and 
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Pnt^Moa'ocs. Nuclei of 25 g of induced leaves of 
N28-220 were prepared according to Cox and Goldberg 
(1988) and stored the nuclei at a temperature of -70°C. 
This method causes the nascent precursor mRNA chains 
and the RNA polymerase II complexes to halt while the 
complexes remain associated at the DNA. A batch of 
these nuclei was assayed for the ability to incorporate 
radioactively labeled DTP as a measure for the 
transcriptional viability of the nuclei (Cox and 
Goldberg (1988). This incorporation could be 
successfully repressed by addition of a-amanitin to a 
final concentration of 2 Mg/al. This shows that the UTP 
incorporation was due to transcript elongation by RNA 
polymerase II and that RNA synthesis on the protein 
coding genes which are occupied by RNA polymerase II 
can be reactivated under the appropriate experimental 
conditions. 

Batches of the nuclei of N28-220 were used to 
synthesize radioactively labeled RNA as described by 
Cox and Goldberg (1988). The radioactive RNA 
synthesized is a direct representation of the 
distribution of the RNA polymerases II complexes on the 
DNA in the nuclei. As the DNA of N28-220 carries two 
genes which can be assayed, namely the chimaeric neo 
gene and the chimaeric Bt ICP gene, it is possible to 
compare the distribution of RNA polymerase II complexes 
on these two genes. To this end, the radioactive RNA 
was extracted from the nuclei according to Cox and 
Goldberg (1988) and used as a probe in a conventional 
Southern hybridisation. The Southern blot contained DNA 
fragments carrying the Bt ICP and neo coding sequences 
in a molar excess relative to the neo and Bt ICP RNA 
species present in the radioactive probe. A detailed 
description of the Southern blot is given in fig. i. 
The hybridisation experiment resulted in hybridisation 
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signals to both the neo and Bt ICP coding sequences 
(fig. 1). Densitometric scanning shoved that the 
intensity of the hybridisation signal to the neo and Bt 
ICP coding regions was nearly identical. This result 
5 implies that the number of transcripts initiating from 
the TR dual promoter is about similar in both 
directions. As in plant N28-220 the cytoplasmic neo 
mRNA level is several magnitudes higher than that of Bt 
ICP; this shows that the Bt ICP coding sequence indeed 
10 negatively controls accumulation of cytoplasmic Bt ICP 
mRNA, but that this phenomenon is not due to a dominant 
negative effect on transcription initiation of the 
chimaeric Bt ICP plant gene. 

15 Example 2. Transcription Elongation 

The relative distribution of RNA polymerase II 
complexes on the Bt ICP plant genes present in 
transgenic plant N2 8-220 which is described by Vaeck et 
al. (1987) was investigated. To this end, a second 

20 experiment was carried out with batches of the nuclei 
of N28-220 described in Example 1. 

The nuclei were incubated as described by Cox and 
Goldberg (1988) to synthesize radioactively labeled 
RNA. The radioactive RNA was extracted as described 

25 previously to provide a probe for a Southern 
hybridisation. The Southern blot prepared for this 
experiment contained several fragments of the Bt ICP 
coding sequence in molar excess relative to the 
complementary RNA present in the probe. The rationale 

30 of the experiment was that if the RNA polymerase II 
complexes were equally distributed over the Bt ICP 
coding region, the hybridisation with the different Bt 
ICP DNA fragments present on the Southern blot would be 
proportional to the size and dATP content of the 
different fragments. A detailed description of the DNA 
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fragments present on the Southern is given in fig 2a. 
The hybridisation of the radioactive SNA extracted from 
the nuclei of N28-220 with the Southern revealed that 
the complete Bt ICP coding sequence as present in 
N28-220 is transcribed by RNA polymerase II. 

Quantification of the hybridisation signals by 
densitometric scanning of the autoradiogram shoved that 
more radioactively labeled RNA was hybridising with DNA 
fragments representing Bt ICP sequences located 5' on 
the Bt ICP coding sequence than with Bt ICP sequences 
located 3 1 on the Bt ICP coding sequence. The actual 
values are given in fig 2a. This in vitro experiment 
demonstrates that in vivo the RNA polymerases are not 
evenly distributed over the Bt ICP coding sequence. 

The site(s) involved in reducing the RNA 
polymerase II elongation were then determined more 
accurately. Nine M13 derivatives were made that carry 
overlapping fragments of the Bt2 coding sequence 
spanning the region from the AUG to 1584 nucleotides 
downstream. The inserts were oriented into the vector 
such that, in single stranded H13 derivatives, the Bt 
sequences were complementary to the Bt transcript. A 
schematic view of the M13 clones is given in fig. 2b. 

A molar excess of each single stranded anti-Bt DNA 
was bound to nylon filters to serve as a DNA target for 
hybridisation with labeled RNA prepared from nuclear 
run-on assays with N28-220 nuclei as described by Cox 
and Goldberg (1988). Three nuclear run-ons that 
differed only in their time period of incubation were 
carried out simultaneously. The incubation time 
determines the length of extension of the nascent mRNA 
chain. Shorter incubation periods give a more accurate 
view of the position of the RNA polymerase II complexes 
relative to the substrate DNA and their ability to 
elongate at the moment of the start of incubation. 
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Hence, the shorter the in vitro incubation period, the 
more accurate the view in predicting the in vivo 
situation. 

The results are shown in fig. 2c. The data for the 
5 minute incubation show that, in vivo , at a very 
discrete inhibitory zone along the bt2 coding sequence, 
one or more factors interfere with transcript 
elongation and that such factor (s) remain present in 
such inhibitory zone during the course of the in vitro 
mRNA extension reaction* Increased incubation periods 
show that, on a subset of DNA templates, RNA synthesis 
resumed downstream of such inhibitory zone in this 
assay without significantly removing the inhibition in 
the inhibitory zone itself. In this regard, the data 
indicate that: 

1. The inhibitory zone causes the RNA polymerases to 
pause and not to terminate. 

2. This pause is only transitory for a small fraction 
of the Bt DNA templates which were used. 

3. The continued RNA polymerase elongation, 
downstream of the inhibitory zone, is done by a 
large number of polymerases on the relatively 
small fraction of the Bt DNA templates. 

It is believed, therefore, that low cytoplasmic Bt 
mRNA levels are due at least in part to inefficient 
production of precursor mRNA caused by inefficient 
elongation of a nascent transcript and/or stalling of 
RNA polymerase II complexes from transcribing at an 
inhibitory zone. 

The inhibitory zone was assayed for its ability to 
interact with proteins present in nuclei of tobacco 
protoplasts. A crude nuclear extract was prepared from 
tobacco SRI leaf protoplasts according to Luthe and 
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Quatrano (1980) and used for filter binding assay 
essentially as described by Diffley and Stillman 
(1986). 100 ng samples of protein extract were nixed 
with different amounts of radioactively labeled 532 bp 
Xbal-AccI bt884 DMA fragment, ranging from 0 to 1670 
picomolar, in a final volume of 0.150 ml binding buffer 
(10 mM Tris P H 7.5, 50 mH NaCl, 1 mM DTT, I mM EDTA and 
5% glycerol). After 45 minutes incubation at room 
temperature, the samples were filtered through an 
alkali-washed nitrocellulose membrane and washed twice 
with 0.150 ml of an ice-cold solution containing 10 mM 
Tris pH 7.5, 50 mM HaCl and 1 mH EDTA. The retention of 
DNA-protein complex was quantified by scintillation 
counting and revealed that the binding had a 
dissociation constant in the loo picomolar range. The 
binding was not affected by preincubation of the 
nuclear extract with a molecular excess of a specific 
competitor DNA. 

20 Example 3. Constru ction of Deletion Mutants 

The previous two examples demonstrate that the Bt 
ICP coding sequence in a chimaeric plant gene 
negatively affects the cytoplasmic Bt ICP mRNA level 
directed by the chimaeric plant gene. It is shown that 

25 this negative control is not at the level of 
transcription initiation but at least in part due to a 
reduced ability of RNA polymerase 11 to generate 
precursor Bt ICP mRNA. a deletion analysis of the 
chimaeric Bt ICP plant gene was performed to identify 

30 whether impaired transcription elongation is the 
exclusive mechanism by which the Bt ICP sequence 
interferes with gene expression. The rationale of the 
experiment is that the introduction of specific 
deletions in the Bt ICP coding region could remove or 
inactivate the sequence element(s) responsible for the 
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negative control. As a result such mutant gene would 
direct an increased level of cytoplasmic mRNA. This 
method can therefore be used to map and identify the 
sequence (s) involved in the negative control. 
5 To perform this analysis, a deletion series of the 

bt860- neo gene (Vaeck et al. f 1987) was made. Fig. 3 
gives a schematic representation. The resultant 
deletion derivatives do not specify a Bt ICP and 
therefore are assayed at RNA level only. In order to 

10 obtain accurate Bt ICP mRNA concentration values, the 
deletion mutants were compared in a transient 
expression system using tobacco leaf protoplasts of SRI 
(Cornelissen and Vandewiele, 1989). The relative mRNA 
abundances were calculated using a correction factor 

15 provided by the mRNA level specified by the cat 
reference gene present on the same plasmid as the 
mutant Bt ICP gene. Four hours after introduction of 
the genes the tobacco leaf protoplasts were harvested, 
and total RNA was prepared and analysed (fig. 4). 

20 mutants nos. 50-60 (fig. 3) show that 

progressive deletions of the carboxy-terminal part or 
the amino-terminal part of the Bt ICP coding sequence 
result in a gradually increasing neo transcript level. 
As there are not very abrupt changes in transcript 

25 levels, these results suggest that the low transcript 
level produced by full length Bt ICP genes is 
controlled by a number of signals. Deletions within the 
Bt ICP coding sequence indeed did not localise a 
specific sequence element which, by itself, is 

3° responsible for the low Bt ICP mRNA level. Similarly, 
cloning of fragments of the Bt ICP coding sequence in 
pJD50 (fig. 3) did not allow identification of such a 
region. 

The relative transcript levels were plotted 
against the length of the Bt ICP sequence present in 
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the different deletion derivatives. Fig. 5 suggests 
that hybrid Bt icp-neo transcript levels drop with 
increasing length of the Bt ICP sequence, m this 
respect, the mutants nos. 61-66 (fig. 3 ) form an 
exception as they show in average a low transcript 
level relative to the length of the Bt ICP sequence. 

These results show that the low transcript levels 
of Bt ICP plant genes in tobacco are not exclusively 
due to an impaired elongation of the nascent transcript 
but that a number of signals operate to cause a reduced 
expression capacity of the chimaeric Bt ICP gene. 

Example 4. 

To determine whether cytoplasmic events are 
important in causing inefficient expression of the bt2 
gene in plants, the following test was carried out. 
Cytoplasmic bt2 mRNA steady state levels in transgenic 
leaf protoplasts of H28-220 are normally found to be 
below 1 transcript per cell. The steady state level is 
determined by, and is proportional to, the number of 
bt2 transcripts entering per time unit the cytoplasm 
and the cytoplasmic half-life of the transcript. When 
steady state levels are achieved, the absolute numbers 
of transcripts entering and leaving the cytoplasmic bt2 
25 mRNA pool are equal. Therefore, the cytoplasmic half- 
life and cytoplasmic steady state level of the bt2 
transcript will reveal whether its cytoplasmic steady 
state level is due to a relatively low import of bt2 
transcript, a relatively high turnover (i.e., 
conversion to a protein) rate, or a combination of 
both. 

The cytoplasmic turnover of bt884 transcripts was 
determined according to Gallie et al. (1989). a capped 
and polyadenylated synthetic bt884 mRNA was produced in 
vitro according to protocols of Promega Corporation 
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(Madison, Wisconsin, USA) and introduced into tobacco 
leaf protoplasts simultaneously with a synthetic bar 
(De Block et al., 1987) mRNA. The two synthetic 
transcripts differed only in their coding sequences. At 
various times after RNA delivery, samples were taken, 
and total RNA was isolated. Northern analyses revealed 
that the half-lives (T 1/2) of the synthetic bt884 and 
bar transcripts were about 8+3 hours and 5+2 hours, 
respectively. See Table 1, below. These data show that 
the bt884 coding sequence, more particularly the bt884 
codon usage and the AU-rich motifs in the bt884 coding 
sequence, do not render the bt884 mRNA more unstable 
than the bar mRNA which is known to accumulate in the 
cytoplasm to about 1000 transcripts per tobacco leaf 
protoplast (calculated from Cornel issen, 1989) • The low 
cytoplasmic steady state level of the bt884 transcripts 
is, therefore, caused by a lack of import of 
transcripts into the cytoplasm. Thus, the expression 
defect of the bt884 gene has to be restored by 
introduction of modifications in the bt884 coding 
sequence that improve the expression pathway in the 
nucleus. 

Expression of the btl4, btl5 and bt!8 genes in 
tobacco revealed that these genes also direct low 
cytoplasmic mRNA steady state levels. Therefore, a 
similar analysis was carried out with synthetic btl4, 
bt!5 and bt!8 transcripts to identify whether the 
expression defect had a cytoplasmic or nuclear 
character. Table 1, below, shows that all three 
transcripts behave as stable mRNAs in the cytoplasm of 
tobacco leaf protoplasts. Therefore, btl4, bt!5 and 
bt!8 genes, like the bt884 gene, must be deficient in 
exporting high levels of bt transcript to the 
cytoplasm, and to improve the expression of such genes, 
it is necessary to modify their coding sequences so 
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that nuclear events, which interfere with efficient 
gene expression, are avoided or ameliorated. 

Table l 

Half-life determination of synthetic bt and bar mRNAs 
in Nicotiana tabacum cv. Petite Havanna SRI leaf 
protoplasts 
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Example 

10 


l* t BRNA 


Tl/2 
(Hours) 


2 nd mRNA 


Tl/2 
(Hours) 


A 
B 
C 

15 D 


bt884 
bt!4 
bt!5 
bt!8 


B+/-3 
7+/~2 
12+/-5 
10+/-5 


bar 
bar 
bar 
bar 


5+/-2 
6+/-3 
21+/-12 
12+/-5 


Legend 











The synthetic bar transcripts had a length of 783 bases 
and included a cap, the TMV leader (77 bases, Danthinne 
and Van Emmelo, 1990), the bar coding sequence ( 552 
bases; De Block et al., 1987), a trailer of 52 
nucleotides consisting of the bases GAUCA CGCGA AUU and 
39 bases from the pGEM-3Z (Promega) polylinker (Kpnl 
(T4 DNA pol.)-HindIH (T4 DNA pol.), and a poly(A) of 
the composition (A) 3S 6(A)3 2 G(A) 32 , followed by the 
nucleotides GOT. 

The synthetic bt884 transcripts had a length of 2066 
bases and included a cap, the TMV leader (77 bases) , 
the bt884 coding sequence followed by the trailer until 
the Klenow treated PstI site (1843 nucleotides), the 
trailer continued with AAUUC CGGGG AUCAA uu, 39 bases 
of the PGEM-3Z polylinker and the (A) 3J G(A) 32 G(A) 21 
Poly(A) , followed by the nucleotides CG. 
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The synthetic bt!4 transcripts had a length of 2289 
bases and included a cap, the TMV leader (77 bases), 
the bt!4 coding sequence till the Klenov treated Bell 
site (2023 bases), plus 26 supplementary nucleotides CG 
5 UCG ACC UGC AGC CAA GOT UGC UGA, a trailer starting 
with UUGAU UGACC GGAUC CGGCU CUAGA AUU, followed by 39 
bases of the pGEM-32 poly linker, and the 
(A)33G(A) 32 G(A) 21 poly (A), followed by the nucleotides 
CGGUA CCC. 

10 

The synthetic bt!5 transcripts had a length of 2198 
bases and included a cap, the TMV leader (77 bases) the 
bt!5 coding sequence as in pVE35 (PCT publication 
WO90/15139) followed by the trailer till the Klenow 
15 treated BamHX site (1989 bases), the trailer then 
continued with AAUU, 39 bases of the pGEM-32 poly linker 
and the ( A) 33G (A) 32 G (A) # poly (A), followed by the 
nucleotides CG. 

20 The synthetic bt!8 transcripts had a length of 2184 
bases and included a cap, the TMV leader (77 bases) the 
bt!8 coding sequence until the Klenow treated BcLI site 
(1918 bases), followed by 26 nucleotides until the 
translation Stop CG UCG ACC UGC AGC CAA GCU UGC UGA, a 

„ trailer starting with UUGAU UGACC GGAUC GAUCC GGCUC 

25 

AGAUC AAUU, 39 bases of the pG£M-3Z poly linker and the 
(A)33G(A) 32 G(A) 21 poly(A), followed by the nucleotides 
CG. 

Example 5. Construction of Modified Bt ICP Genes 
30 Examples 1-4 show that the expression in a plant 

of a Bt ICP gene is negatively affected by the Bt ICP 
coding sequence at both transcriptional and post- 
transcriptional levels, but principally by nuclear 
events* These examples also show that the control of 
expression is not confined to a specific DNA sequence 
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within the Bt icp coding sequence. Instead, the 
negative effect on gene expression is an intrinsic 
property of the Bt ICP coding sequence. On this basis, 
it is believed that, by directed change of the DNA 
sequence of the Bt ICP coding region, an improvement of 
gene expression will occur. The improvement will be of 
a cumulative type as the negative influence of the Bt 
ICP coding region is spread over the complete coding 
sequence. Similarly, an improvement of gene expression 
will be obtained by reduction of the length of the Bt 
ICP coding sequence. This improvement will have a 
cumulative effect if used in combination with 
modifications of the Bt ICP coding region. 

Therefore, two types of modifications were 
introduced into a Bt ICP (i.e., bt2) coding sequence 
which, as will be shown, indeed resulted in a 
significant increase in Bt ICP plant gene expression. 
First, the DNA sequence was modified in the central 
region of the toxic core fragment of the Bt ICP as 
transcription elongation is impaired in this region. 
Secondly, the length of the Bt ICP coding sequence was 
reduced as the negative influence is proportional to 
the length of the Bt ICP coding sequence. A detailed 
description of the mutations is given in figs. 6a, b 
and c. As shown in fig. 7, the modifications change the 
AT-content of the chimaeric Bt ICP gene significantly. 
The modifications change the primary DNA structure of 
the Bt ICP coding sequence without affecting the amino 
acid sequence of the encoded protein. It is evident 
that, if more DNA mutations were to be introduced into 
the Bt ICP coding sequence, a further improvement of 
gene expression would be obtained. 

To determine the effect of the modifications, the 
expression properties of the modified BtlCP gene and 
the parental bt860-neo gene were compared in a 
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transient expression system as described by Cornelissen 
and Vandeviele (1989) and Denecke et al. (1989). 
Basically! the accumulation profiles of the genes under 
study were compared by relating their profiles to the 
5 profile of a reference gene present in the same 
experiment* Fig* 8a shows the vectors used in the 
assay, and fig. 8b shows that the accumulation of the 
reference CAT protein is nearly identical in both 
experiments. It is not possible to measure the 

10 accumulation of Bt ICP encoded by the parental 
bt860-neo gene, but the modified Bt ICP gene clearly 
directs an increased synthesis of Bt ICP. 

These results demonstrate that mutation of the Bt 
ICP coding sequence relieves the negative influence of 

15 the Bt ICP coding sequence on the expression of a Bt 
ICP plant gene. 

Example 6. Cloning and Expression of Modified BT ICP 
Genes in Tobacco and Potato Plants 

20 Using the procedures described in US patent 

application 821,582, filed January 22, 1986 f and EPA 
86300291.1, EPA 88402115.5 and EPA 89400428.2, the 
modified Bt ICP (i.e., bt2) genes of figs. 6 and 7 are 
inserted into the intermediate T-DNA vector, pGSH1160 

25 (Deblaere et al., 1988) between the vector's T-DNA 
terminal border repeat sequences. 

To obtain significant expression in plants, the 
modified Bt ICP genes are placed under the control of 
the strong TR2 9 promoter (Velten et al., 1984) said are 

30 fused to the transcript 3 V end formation and 
polyadenylation signals of the T-DNA gene 7 (Velten and 
Schell, 1985). 

In addition, the translation initiation context or 
site are changed in accordance with the Joshi consensus 
sequence (Joshi, 1987) in order to optimize the 
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translation initiation in plant cells. To this end, an 
ollgo duplex (figs. 6a and 6b) is introduced to create 
the following sequence at translation initiation site: 
AAAACCATGGCT. in this way, an additional codon (i.e., 
GCT) coding for alanine is introduced. Additionally, 
Kpnl and BstXI sites are created upstream of the ATG 
translation initiation codon. 

Using standard procedures (Deblaere et al. f 1985), 
the intermediate plant expression vectors, containing 
the modified BtlCP gene, are transferred into the 
Agrobacterium strain C58C1 Rif* (US patent application 
821,582; EPA 86300291.1) carrying the disarmed Ti- 
plasmid pGV2260 (Vaeck et al., 1987). Selection for 
spectinomycin resistance yields cointegrated plasmids, 
consisting of p6V2260 and the respective intermediate 
plant expression vectors. Each of these recombinant 
Agrobacterium strains is then used to transform 
different tobacco plant cells (Nicotiania tabacum) and 
potato plant cells ( Solanum tuberosum ! so that the 
modified Bt ICP genes are contained in, and expressed 
by, different tobacco and potato plant cells. 

The transgenic tobacco plants containing the 
modified Bt ICP genes are analyzed with an ELISA assay. 
These plants are characterized by a significant 
increase in levels of Bt (Bt2) proteins, compared to a 
transgenic tobacco plant containing a non-modified Bt 
ICP (bt2) gene. 

The insecticidal activity of the expression 
products of the modified Bt ICP (bt2) genes in leaves 
of transformed tobacco and potato plants is evaluated 
by recording the growth rate and mortality of larvae of 
Tobacco hornworm ( Manduca sexta) , Tobacco budworm 
(Heliotis virescens) and potato tubermoth ( Phthorimaea 
operculella ) fed on leaves of these two types of 
plants. These results are compared with the growth rate 
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of larvae fed leaves from tobacco and potato plants 
transformed with the unmodified or parental Bt ICP 
(bt2) gene and from untransformed potato and tobacco 
plants. Toxicity assays are performed as described in 
5 EPA 88402115.5 and EPA 86300291.1. 

A significantly higher mortality rate is obtained 
among larvae fed on leaves of transformed plants 
containing and expressing the modified Bt ICP genes. 
Tobacco and potato plants containing the modified Bt 
10 ICP genes show considerably higher expression levels of 
Bt ICPs compared to tobacco and potato plants 
containing the unmodified Bt ICP gene. 

The insecticidal activity of three transgenic 
tobacco plants containing the modified Bt ICP genes is 
15 determined against second and third instar larvae of 
Hellothis virescens . The control plant was not 
transformed. The results are summarized in Table 2, 
below. 

20 Table 2 

Plant % mortality of insects (recorded after 

5 days) 



11 
100 

88.5 
100 

Needless to say, this invention is not limited to 
tobacco and potato plants transformed with the modified 
Bt ICP gene. It includes any plant, such as tomato, 
alfalfa, sunflowers, corn, cotton, soybean, sugar 
beets, rapeseed, brassicas and other vegetables, 
transformed with the modified Bt ICP gene. 



Control 
No. 1 
No. 2 
No. 3 
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Nor is the invention limited to the use of 
Aqrobacterium tumefaclens Ti-plasmids for transforming 
Plant cells with a modified Bt ICP gene, other known 
techniques for plant transformation, such as by means 
of liposomes, by electroporation or by vector systems 
based on plant viruses or pollen, can be used for 
transforming monocotyledonons and dicotyledons with 
such a modified Bt ICP gene. 

Hor is the invention limited to the bt2 gene, but 
rather encompasses all Cry I, cry n, cryin and Cry iv 
Bt ICP genes. 
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Claims 

l. 



A process for modifying a Bt ICP gene to improve 
its expression in a plant cell, transformed with the 
5 gene; the process comprising the step of: changing A 
and T sequences in a plurality of translational codons 
of the gene to corresponding 6 and C sequences encoding 
the same amino acids, so as to improve the gene's 
transcription to an mRNA, the nuclear accumulation of 
10 the mRNA and/or the nuclear export of the mRNA, 
particularly the gene's transcription, in the plant 
cell. 

2. The process of claim 1 for modifying a Bt ICP gene 
15 to improve its transcription in plant cells, 

transformed with the modified gene, wherein the 
plurality of translational codons is at least one 
region of the gene which, during transcription, has 
thereon a relatively low percentage of rna polymerase 
20 II of the plant cell as compared to another adjacent 
upstream region of the gene. 

3. The process of claim l or 2, wherein the Bt ICP 
gene encodes a Bt insecticidal crystal protein 

^ truncated towards a trypsin cleavage site, preferably 
at both its C-terminal and N-terminal ends, and 
preferably encoding a portion of the protein of about 
60 - 80 kDa, particularly the toxin of the protein. 

4. The process of anyone of claims 1-3, wherein A and 
30 T sequences of at least 3 codons are changed to G and C 

sequences at a translation initiation site of the gene 
and A and T sequences of at least about 3, preferably 
at least about 10, especially at least about 33, codons 
are changed to G and C sequences in a second region of 
the gene, preferably affecting transcription 
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elongation, downstream of the translation initiation 
site. 

5. The process of anyone of claims 1-4, wherein A and 
5 T sequences of at least about 3, preferably at least 

about 10, especially at least about 33, codons are 
changed to 6 and C sequences in a third region of the 
gene, preferably affecting cytoplasmic RNA 
concentration • 

10 

6. The process of claim 4 or 5, wherein A and T 
sequences of at least about 3 codons are changed to 6 
and C sequences at a translation termination end of the 
gene. 

15 

7. The process of anyone of claims 4*6, wherein the 
gene is a cryl gene, such as a bt2, btl4, bt!5 or bt!8 
gene, preferably a bt2 gene, or a gene having 
substantial sequence homology thereto. 

20 8. The process of claim 7 wherein the gene is a bt2 
gene; the second region being between about nucleotides 
674 and 1000 and A and T sequences of about 59 or more 
codons are changed to 6 and C sequences in the second 
region, particularly between about nucleotides 733 and 

25 1000, quite particularly between about nucleotides 765 
and 794. 

9. The process of anyone of claims 1-8, wherein the 
gene is further modified by substituting for its ATG 

3Q translation initiation site : AAAACCATGGCT • 

10. The modified Bt ICP gene obtained by the process 
of anyone of claims 1-9. 
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11. A chimaeric gene for transforming a cell of a 
plant, comprising the following operably-linked DNA 
fragments in the same transcriptional unit: 

a) the modified Bt ICP gene of claim 10? 

b) a promoter capable of directing expression of 
the modified Bt ICP gene in the plant cell; and 

c) transcript 3« end formation and polyadenylation 
signals suitable for expressing the modified Bt 
ICP gene in the plant cell. 

12. The plant cell of claim n, transformed with the 
chimaeric gene of claim 11. 

13. A plant, plant tissue or plant cell culture 
consisting of the plant cells of claim 12. 

14. A seed of the plant of claim 13. 

15. A vector, preferably a Ti-plasmid, for stably 
transforming the nuclear genome of a plant, comprising 
the chimaeric gene of claim 11. 

16. A process for protecting the plant of claim 10 
25 against an insect pest, comprising the step of: 

transforming the genome of the plant with the chimaeric 
gene of claim 11. 

17. A process for modifying a foreign gene whose rate 
and/or level of expression in a plant cell, transformed 
with the gene, is substantially limited by the rate 
and/or level of nuclear production of an mRNA encoded 
by the gene; the process comprising the step of: 
changing A and T sequences in a plurality of 
translational codons of the gene, particularly in a 
plurality of translational codons in at least one 
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region of the gene which, during transcription, has 
thereon a relatively low percentage of UNA polymerase 
II of the plant cell as compared to another adjacent 
upstream region of the gene; the A and T sequences 
being changed to corresponding 6 and C sequences 
encoding the same amino acids, so as to improve the 
gene's transcription to the mRNA, the nuclear 
accumulation of the mRNA and/or the nuclear export of 
the mRNA, particularly the gene f s transcription to the 
mRNA, quite particularly the transcript elongation by 
RNA polymerase II on the gene, in the plant cell. 
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Figure 2c 
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0LIGOPS15 

GTACCAAAACCATGGCTATCGAGACCGGTTACACCCCAATCGATATCG 



OLIGOPS16 

ATCGATTGGGGTGTAACCGGTCTCGATAGCCATGGTTTTGGTACCGAT 
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Figure 6a(cont.) 

Linear LENGTH *- 52 

1 5 48 52 

> OLIGOPS15 > 

< -OLIGOPS16' -< 

! , , 52 



3) OLIGOPS16', 4) OLIGOPS15, 



Name Base 

3 1 ATCGGTACCA AAACCATG6C TATCGAGACC GGTTACACCC CAATCGAT 

4 1 GTACCA AAACCATGGC TATCGAGACC GGTTACACCC CAATCGATAT CG 

CON 1 ATCGGTACCA AAACCATGGC TATCGAGACC GGTTACACCC CAATCGATAT CG 
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Figure 6a(ocnt.) 



ATC GGT ACC AAA ACC AT6 GCT ATC GAG ACC GGT TAC ACC CCA ATC GAT ATC G 

MET Ala lie Glu Thr Gly Tyr Thr Pro He Asp He 
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0LIG01 

GATCCTCTAGAGACTGGATCAGGTACAACCAQTTCAGGAGGGAGTTAACCCTAACCGTGTTAQAC 
0LIGO2 

ATCGTGTCCCTATTCCCGAACTACGACAGCAGOACGTACCCAATCCGAACCGTGTCCCAGTTAACCAGGGA 
OLIG03 

GATCTACACCAACCCAGTGTTAGAGAACTTCGACGGTAGCTTCCGAOGCTCGGCTCAGGGCATCG 
0LIG04 

AGGGAAGCATCAGGAGCCCACACTTGATGGACATCCTTAACAGCATCACCATCTACACOGACGCT 
0LIG05 

CACAGGGGAGAGTACTACTGGTCCGGGCACCAGATCATGGCTTCCCCTGTGGGGTTCTCGGGGCCAOAATTCG 



OLIG06 

GATCCGAATTCTGGCCCCGAGAACCCCACAGGGGAAGCCATGATCTGGTGCCCGGACCAGTAGTAC 
0LIG07 

TCTCCCCTGTGAGCGTCCGTGTAGATGGTGATGCTGTTAAGGATGTCCATCAAGTGTGGGCTCCT 
0LIGO8 

GATGCTTCCCTCGATGCCCTGAGCCGAGCCTCGGAAGCTACCGTCGAAGTTCTCTAACACTGGG 
0LIG09 

XTGGTGTAGATCTCCCTGGTTAACTGGGACACGGTTCGOATTGGGTACGTCCTGCTGTCGTAGTTCGGGAA 



0LIG010 
TAGGGA 



CACGATGTCTAACACGGTTAGGGTTAACTCCCTCCTGAACTGGTTCTACCTGATCCAGTCTCTAGAG 
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synthe-9 Linear LENGTH - 343 

149 213 278 

77 148 212 277 

5 66 137 202 267 343 

J«-_«.„„..JLI! 136 201 266 339 

O-OLIGOIO' « OLIG09 ' C--OLIG08 ' «— OLIG07 ' <-~OLITO6~--^< 

>— OLIGOl > 0LIGO2 > 0L1G03 > OLIG04 > OLIG05 > 



Name Base 

1 1 GATCCTCTAG AGACTGGATC AGGTACAACC AGTTCAGGAG GGAGTTAACC CTAACCGTGT 

10 1 CTCTAG AGACTGGATC AGGTACAACC AGTTCAGGAG GGAGTTAACC CTAACCGTGT 

CON 1 GATCCTCTAG AGACTGGATC AGGTACAACC AGTTCAGGAG GGAGTTAACC CTAACCGTGT 

1 61 TAGAC 

10 57 TAGACATCGT GTCCCTA 

2 1 ATCGT GTCCCTATTC CCGAACTACG ACAGCAGGAC GTACCCAATC CGAACCGTGT 
9 1 TTC CCGAACTACG ACAGCAGGAC GTACCCAATC CGAACCGTGT 

CON 61 TAGACATCGT GTCCCTATTC CCGAACTACG ACAGCAGGAC GTACCCAATC CGAACCGTGT 



2 56 CCCAGTTAAC CAGGGA 
9 44 CCCAGTTAAC CAGGGAGATC TACACCAA 

GATC TACACCAACC CAGTGTTAGA GAACTTCGAC GGTAGCTTCC 
CC CAGTGTTAGA GAACTTCGAC GGTAGCTTCC 



3 1 
8 1 



CON 121 CCCAGTTAAC CAGGGAGATC TACACCAACC CAGTGTTAGA GAACTTCGAC GGTAGCTTCC 



3 45 GAGGCTCGGC TCAGGGCATC G 
8 33 GAGGCTCGGC TCAGGGCATC GAGGGAAGCA TC 

AGGGAAGCA TCAGGAGCCC ACACTTGATG GACATCCTTA 
AGGAGCCC ACACTTGATG GACATCCTTA 

CON 181 GAGGCTCGGC TCAGGGCATC GAGGGAAGCA TCAGGAGCCC ACACTTGATG GACATCCTTA 



4 1 

7 1 



5 1 

6 1 



4 40 ACAGCATCAC CATCTACACG GACGCT 
7 29 ACAGCATCAC CATCTACACG GACGCTCACA GGGGAGA 

CACA GGGGAGAGTA CTACTGGTCC GGGCACCAGA 
GTA CTACTGGTCC GGGCACCAGA 

CON 241 ACAGCATCAC CATCTACACG GACGCTCACA GGGGAGAGTA CTACTGGTCC GGGCACCAGA 

5 35 TCATGGCTTC CCCTGTGGGG TTCTCGGGGC CAGAATTCG 

6 24 TCATGGCTTC CCCTGTGGGG TTCTCGGGGC CAGAATTCGG ATC 

CON 301 TCATGGCTTC CCCTGTGGGG TTCTCGGGGC CAGAATTCGG ATC 
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GAT CCT CTA GAG ACT GGA TCA GGT ACA ACC ACT TCA GGA GG6 ACT TAA CCC TAA 
Ser Ser Arg Asp Trp lie Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Tht 



81 208 
CCG TGT TAG ACA TCG TGT CCC TAT TCC CGA ACT ACG ACA GCA GGA CGT ACC CAA 
Val Leu Asp He Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro He 



135 it? 
TCC GAA CCG TGT CCC AGT TAA CCA GGG AGA TCT ACA CCA ACC CAG TGT TAG AGA 
Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val Leu Glu Asn 



189 216 
ACT TCG ACG GTA GCT TCC GAG GCT CGG CTC AGG GCA TCG AGG GAA GCA TCA GGA 
Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly He Glu Gly Ser He Arg Ser 



243 270 
GCC CAC ACT TGA TGG ACA TCC TTA ACA GCA TCA CCA TCT ACA CGG ACG CTC ACA 
Pro His Leu MET Asp He Leu Asn Ser He Thr He Tyr Thr Asp Ala Bis, Arg 



297 324 
GGG GAG AGT ACT ACT GGT CCG GGC ACC AGA TCA TGG CTT CCC CTG TGG GGT TCT 
Gly Glu Tyr Tyr Trp Ser Gly His Gin He MET Ala Ser Pro Val Gly Phe Ser 



CGG GGC CAG AAT TCG GAT C 
Gly Pro Glu Phe Gly 
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Figure 6c 



PVE36.SBQ nucleotide 800 - 4000 



AAATGGATAAATAGCCTTGCTTCCTATTATATCTTCCCAA ATT ACCA AT ACATT ACACT AG CAT CTG AAT 

TTCATAACCAATCTCGATACACCAAATCX3MG0ATCCCGATAACAATCCGAACATCAATGAATGCATTCC 

TTATV^TTGTTTAAGTAACCCTGAAGTAGAAGTATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCA 

ATCGATATTTCCTTGTCGCTAACX5CAATTTCTTTTGAGTGAATTTGTTCCCGGTGCT 

GACrAGTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATTGAACA 

GTTAATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTAGAAG 

CTTTAT CAAATTT ACGCAGAATCTTTTAGAGAGTGGGAAG C AGAT CCT ACT AAT CC AG C ATT A AG AG A AG 

AGATGCGTATTCAATTCAATGACATGAACAGTG(XCOTACAA 

TTATCAAGTTCCTCTTTTATCAGTATATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTT 
TCAGWm'TGGACAAAGGTGGGGATTTGATGreGCGACTATCA 

TTATTGGCyVACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTAT 

TTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGA ATTAAC ACT AACTGT ATT AGAT AT CGTTTCT 

CTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTTTCCCAATTAACAAGAGAAATTTATA 

CAAACCCAGTATTAGAAAATTTTGATGGTAGTTTTCGAGGCTCGGCTCAGGGCATAGAA^ 

GAGTCCACATTTGATGGATATACTTAACAGTATAACCATCTATACGGATGCTCATAGAGGAGAATATTAT 

TGGT^GGGCATCAAATAATGGCTTCTCCTGTAGGGTTTTCGGGGCCAGAATTCACTTTTCCGCTATATG 

GAACTATGGGAAATGCAGCTCCACAACAACGTATTGTTGCrCAACTAGCT 

ATCGTCCACPTPATATAGAAGACCTTTTAATATAGGGATAAATAATCAACAACTATCTGTTCTTGACGGG 

ACAGAATTTGCTTATGGAACXTCCTCAAATTTGCCATCXX5CTGTATACAGAAAAAGCGGAA 

CX3CTGGATGAAATACCGCCACAGAATAACAACGTGCCACCTAGGCAAGGAOT 

TGTTTCAATGTTTCGTTCAGGCrTTAGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCT 

ATACATCGTAGTGCTGAATTTAATAATATAATTCCTTCATCACAAATTACACAAATACCTTTAACAAAAT 

CTACTAATCTTGGCTCTGGAACTTCTGTCGTTAAAGGACCAGGATTTACAGGAGGAGATATTCTTCGAAG 

AACI^C^CCTGGCCAGATTTCAACCTTAAGAGTAAATATT ACTGCACCATTATCA CA AAG AT ATCGGGTA 

AGAATTCGCTACGCTTCTACCACAAATTTACAATTCCATACATCAATTGACGGAAGACCTATTAATCAGG 

GGAATTTTTCAGCAACTATGAGTAGTGGGAGTAATTTACAGTCCGGAAGCTTTAGGACTGTAGGTTTTAC 

TACTCCGTTTAACTTTTCAAATGGATCAAGTGTATTTACGTTAAGTGCTCATGTCTTCAATTCAGGC 

GAAGTTrATATAGATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAA 

GAGCACAAAAGGCGGTGAATGAGCTGTTTACTTCrTCCAATCAAATCGGGTTAAAAACAGATGTGACGGA 

TTATCATATTGATCAAGTATCCAATTTAGTTG AGTGTTTAT CTG ATG AATTTTGTCTGG ATGAAA AA AA A 

GAATTGTCCGAGAAAGTCAAACATGCGAAGCGACTTAGTGATGAGCGGAAXX XX XCCTCG AGCTTGG ATG 

GATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAAT 

CGGCTGCTCTGATGCCGCCGTGTTCXGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGAC 

CTGTCCGGTGCXCTGAATGAACTGCAGGACGAGGC^GCGCGGCrATCGTGGCTGGCCACGACGGGCGTTC 

CTTGCGCAGCTGTGCTCXJACGTTGTCACTGAAGCGGGAAGGGACTGGCT 

GCAGGATCTCCrGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGG 
CTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTA 
CTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCG CG CCAG CCGA 
ACTGTTCGCCAGGCTCAAGGOGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGC 
TTGCCX3AATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGG 
ACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCG 
CTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAG 
TTCTT CTGA CAGATCCCCCGATGAGCTAAGCTAGCTATATCATCAATTTATGTATTACACATAATATCGC 
ACTCAGTCTTTCATCTACGGCAATGTACCAGCTGATATAATCAGTTATTGAAATATTTCTGAATTTAAAC 
TTGCATCAATAAATTTATGTTTTTGCTTGGACTATAATACCTGACTTGTTATTTTATCAATAAATATTTA 
AACTATATTTCTTTCAAGATGGGAATTAACATCTACAAATTGCCTTTTCTT 

The ATG initiation codon and the TGA stop codon are underlined. 
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Figure fo(oont.) 
PPS029.SEQ nucleotide 800 - 4000 

AAAT66ATAAATAGCCTTGCTTCCTATTATATCTTCCCAAATTACCAATACATTACACTAGCATC7GAAT 
TTCATAACCAATCTCGATACACCAAATCGGTA^ 

GATATTTCCOTGTCGCTAAOTCAATTOCOTTO 

TAGTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACM3 

AATTAACCAAAG AATAGAAGAATTCGCTAGGAACCAAGCCATTTCT AGATTAGAAGG A CT A AG CA AT 
TATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGATCCTACTAATCCAGCATTAAGAGAAGAGA 
TGCGTATTCAATTCMTGACATGAACAGTGCCCTTACAACCGCTATTCCT 
TCAACTTCCTCTTTTATCAOTATAT^^ 

GTGTTTGGACAAAGGTGGGGATTTGATGCCGCXSACTATCAATAGTCGTTATAATGATTTAACTAGGC 
TTGGCAACTATAC^GATCATGCTGTACGCTGGTACAATACGGGA 

TAGAGACTGGATCAGGTACAACCAGTTCAGGAGGGAGTTAACCCTAACCGTGTTAGACATCGTGTCCCTA 
TTCCCGAACTACGACAGCAGGACGTACCCAATCCGAACCGTGTCCCAGTTAACCAGGGAGATCTACACCA 
ACCCAGTGTTAGAGAACrTOGACGGTAGCTTCOGAGGCrCGGCTCAGGGCATCGAGGGAAGCATCAGGAG 
CCCACACTTGATGGAGATCCIVAACAGGATCACCATCTACACGGACGCTCACAGGGGAG 

TCCGGGCACCAGATCATGGCTTCCCCTGTGGGGTTCrCGGGGCCAGAATTCACTTTTCCGCTATATGGAA 
CTATG GGAA ATGCAGCTCC^CAACAAOCTATTGTTGCTCAACTA^ 

GTCCACTTTATATAGAAGACCnTrTAATATAGGGATAAATAATCAACAACTATCTGTTCTTGACGGGACA 
GAATTTGCTTATGGAACCTCXn'CAAATTTGCCATCOGCTGTATACAGAAAAAGCGGAACGGTAGATTCGC 
TGGAT GAAAT ACCGCGAGAGAATAACAACGTGCCAOCPAGGCAAGGATTTAGTCATCGATTAAGCCATGT 
TTCAATGTTTCGTTCAGGCTTTAGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCTCTTGGATA 
CATCGTAGTGCTGAATTTAATAATATAATTCCTTCATCACAAATTACACAAATACCTTTAACAAAATCTA 
CTAATCTTGGCTCTGGAACTTCTGTOGTTAAAGGACCAGGATTTACAGGAGGAGATATTCTTCGAAGAAC 
TTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATTACTGC A C CATT ATC A CAAAG ATAT CG GGT AA G A 

^ZTlE??^^^^^^^^^^^^'^'^^^^^^^^^^^^^^'^* C A ATTG A CGGAA6 A CCTATT A ATC AG6G G A 

ATTTTTCAGCAACTATGAGTAGTGGGAGTAATTTACAGTCCGGAAGCTTTAGGACT 

TCCGTTTAACTTTTCAAATGGATCAAGTGTATTTACGTT AAGTG CTCATGTCTTCAATT CAGGCA ATGAA 

GTTTATATAGATCGAATTGAATTTO1TCCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAAGAG 

CACAAAAGGOGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATGT 

TCATATTGATCAAGTATCCAATTrAGTTGAGTGTTTATCTGATGAATTTTGTCTGGATGAAAAAAAAGAA 

TTGTCCGAGAAAGTCAAACATGCGAAGOGACTTAGTGATGAGCGGAAXXXXXCCTCGAGCTTGGA^PGGAT 

TGCACGCAGGTTCTCOGGCCG CTTGGGTGGAGAGGCTATTCGGCTATG ACT GGG C A C A A C A G A C A AT C G G 

CTGCTCTGATGCCGCCGTGTTCCGGCTCT 

TCCGGTGCCCTGAATGAACTGCAGGACGAGGGAGCGCGGCTATCGTGGCTGGCGACGACGGGCGTTCCTT 

GCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGGTGCTATTGGGCGAAGTGCCGGGGCA 

GGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTG 

CATACGC!TPGATCCGGCrACCTGCC»TTCGACCACCAAGCGAA^ 

GGATGGAAGCCGGTCTTGTMATCAGGATGATCTGGACGAAGAGCATCAGGG 

GTTCGCGAGGCTCAAGGCGOGCATGCCCGAC^GGGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTG 
CCGAATATCATGCTCK^AAATGGCCGCTTTTCTGGATOC 

GCTATCAGGACATAGCGTOGGCTACCCGTGATATTGCTrGAAG AG CTTG GCGG CG AAT GGGCT G A CCGCTT 

CCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCT 

^^S^^^^CCCCXXSATGAGCTAAGCTAGCTATATCATCAATTTATGTATTACACATAATATCGCACT 

CAGTCTTTCATCTACGGCAATGTACCAGCPGATATAATCAGTTATTGAAATATTTCTGAATTTAAACTTG 

CATCAATAAATTTATGTTTTTGCTTGGACTATAATACCTGACT 

TATATCTCTTTCAAGATGGGAATTAACATCTACAAATTGCCTTTTCTTA 

GCTCGAATTCCTACGCAGCAGGTCTCATCAAGACGATCTACCCGAGTAACA 

the ATG Initiation codon and the TGA stop codon are underlined. 
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Figure 8a 
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1 : Aatll 

2 : Accl 

3 : Aflll 

4 : AlvWI 

5 : Apal 

6 : Avrll 

7 : Barrel 



8 : Bell 

9 : BspMII 
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